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Indian Standard
ANALYSIS OF VARIANCE
0. FOREWORD
0.1 This Indian Standard was adopted by the Indian Standards Institution
on 10 February 1975, after the draft finalized by the Quality Control and Industrial Statistics Sectional Committee had been approved by the Executive Committee.

0.2 The technique of analysis of variance is an indispensable tool in the scientific and industrial research for the analysis of experimental data involving quantitative measurements and is particularly helpful when several independent sources of variation are present in the data such as the results obtained at different temperatures, duplicate determinations of the same material made by several analysts, measurements classified according to several sources of supply of raw material obtained from different vendors, etc. 0.3 It is well-known that the observations obtained by repetitive experiThe source of variation in the data may be ments vary among themselves. Using the analysis of variance due to various causes, assignable or chance. techniques it is possible to estimate how much of the total variation in a set of data can be attributed to one or more assignable causes of variation, the remainder which is not attributable to any assignable causes of variation being classed as due to chance causes which produces the residual or error variation, 0.4 This standard is a sequel to the `Indian Standard on Statistical Tests To compare the means of two groups of of Significance' (IS : 6200-1971). observations and to assess whether the difference between them can be reasonably ascribed to chance, the t-test is used. When the comparison is to be made-among the means of more than two groups of observations, resort to the analysis of variance technique is made. In fact, -t-test is a particular case of analysis of variance. 0.5 In reporting the result of a test or analysis, if the final value, observed or calculated, is to be rounded off, it shall be done in accordance with IS : 2-1960*.

1. SCOPE 1.1 This standard intends to give a brief outline of the general treatment of the analysis of variance technique with respect to some of the designs
*Rules for rounding off numerical values (r&d).

3

IS : 7666-

1975

The which are more frequently used in industrial experimentation. techniques have been illustrated with examples wherein the necessary computational details have also been given. 2. TERMINOLOGY 2.0 For the purpose of this standard, the following definitions shall apply. 2.1 Standard Deviation - The square root of the quotient obtained by dividing the sum of squares of deviations of the observations from their mean by one less than the number of observations in the sample. 2.2 Variance Square of standard deviation. com-

2.3 Degrees of Freedom (D. F.) - The number of independent ~ponent values which are necessary to determine a statistic.

2.4 Null Hypothesis - The hypothesis (or assumption) of the equivalence (or no difference) among the effects of methods so that the samples emanate from the same lot. 2.5 Level of Significance - The probability (or risk) of rejecting the null hypothesis when it is true. Conventionally, it is taken to be 5 percent or 1 percent. 2.6 Corrected Sum of Squares - The total of the squares of the deviations of the observations from their mean. 3. SOME BASIC CONCEPTS Model to any matheunder model,

3.1 Mathematical

3.1.1 Before the application of the analysis of variance techniques experimental data, it is fundamental to have some knowledge of the matical model holding good for the particular investigation consideration. Basically there are three models, namely, fixed effects random-effects model and mixed effects model (see also Appendix A).

3.1.2 In the fixed effects model, some of the assignable causes of variation in the experiment are deliberately chosen so that the results of the analysis are not amenable for generalization in that direction. In other words, when the effects are unknown constants (parameter) the model is called fixed effects model or model I. For example, in an inter-laboratory investigation on the checking of the precision of tensile testing machine, four laboratories possessing a particular brand of the machine may be intentionally chosen so that the results of the analysis will apply only to these four laboratories and any conclusion derived will not be applicable to all the laboratories in general. On the other hand, if the tensile testing machines of a large number of laboratories are to be investigated and due to limitations of facilities, the four laboratories chosen are a random sample of all the laboratories, then 4
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themathematical model chosen is that of a random effects model or model II. The mixed effects model, as the name itself indicates, is the combination of the earlier two models where some of the effects are of the fixed nature, the remaining being of random nature. The random effects model has extensive applications in most of the industrial experimentation. 3.2 Additive Nature of Sum of Squares 33.1 As the name implies the technique of analysis of variance consists in separating the total variance into parts, each part measuring the variability attributable to some specific source. For example in an inter-laboratory testing there may be variability among laboratories and variability within the laboratory. The latter, in turn, may be composed of a number of components of ~differing magnitude and importance like the variability among different analysts, among days, or among determinations made on the same day. Using the additive property of the varcance the total variation between members of a set of observations, classified according to one or more criteria, c%an be broken up into components, attributable to different criteria of classification which are of experimental interest or importance. By testing the significance of these components it is possible to determine which of the criteria are associated with a significant proportion of the overall variability in the averages. 3.2.2 Planned experiments have proved to be very powerful and economical in investigating the influence of various factors contributing to the total variance in the measurable characteristics of the product. In these, measurements are taken on a sample of units, which is so constituted as to ensure the simultaneous randomization in experimental error and variation due to changes in treatments. As long- as they are clearly defined and reproducible these treatments may represent different materials or temperatures or processes or any variation in operating conditions. 3.3 Orthogonagty
3.3.1 For of Designs

the easy statistical analysis of the data resulting from any investigation, a desirable feature of the design adopted is its orthogonality. Orthogonality ensures that the different classes of effects shall be capable of direct and separate estimation without any entanglement. 3.3.2 For example, in an experiment to observe the effect of varying two factors, say temperature and pressure, wherein the temperature is measured at four levels and pressure at three levels, there will be 12 experimentat conditions generated by taking each level of temperature with each level of The comparison between the average of three results at any two pressure. temperatures will then be purely a measure of the effect of temperature (and vice versa) since the averages will have been taken over the same set of pressures. The two factors are said to be mutually orthogonal. Now by accident or design suppose that the combination corresponding to the ~highest temperature and pressure is missing from the experiment, 5
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then the difference between the averages of the observations at the lowest and the highest temperatures could not be purely a measure of the effect of temperature because the first average is based on observations at three levels of pressure whereas the second observation is based on the average at only two levels of pressure and hence the difference between the two averages will be, to an unknown extent, influenced by the effect of pressure. The experiment in this case is non-orthogonal. 3.4 The details of the application of the technique of analysis of variance will vary with the number of independent causes of variation. It is possible to class@ the data with respect to each independent source ofvariation and the complete classification is a necessary first step to the application of analysis of variance. The following sections describe the procedure for analysis of variance under different categories which are commonly met in practice. 4. ANALYSIS

CLASSIFICATION OF DATA

OF VARIANCES

SINGLE

FACTOR

OR ONE-WAY

4.1 Replicate Determinations 4.1.1 Data will frequently be encountered wher%e classification is based on one factor only, for example analysts or temperatures or batches of material. The data may consist of (a) replicate determinations of the same material made by several analysts in the same laboratory or (b) measurements obtained at different temperatures or (c) measurable characteristics of some material obtained in different lots or shipment from the same supplier, etc. 4.1.2 In all these, there will be variation withinreplicates (unassignable There will variation in the system) which is a measure of the precision. also be a variation in means of results obtained under different conditions. It is due to the differences among analysts or differences in temperatures or lot-to-lot variation of the shipment. Analysis of variance helps to separate these effects and to determine whether there is any significant difference between operators or temperatures or shipments, as the case may be. 4.1.3 Usually, in the interest of experimental efficiency and simplicity of analysis, it is desirable to have the same number of replicate determinations for each class. But owing to a lack of design or to the loss ofpart of data or natural grouping of experimental material or perhaps due to deliberate placing of emphasis on certain effects, the number of observations in various The following two examples illustrate the method classes may be unequal. of analysis when there are (a) unequal number of observations in each class, and (b) equal number of observations in each class. following data represent the warpway breaking 4.1.4 Example 1 -The strength of Type II Indian hessian [see IS : 2818 (Part II)-1971*] measured
*Specification for Indian h&an: (jfs: wi.Go~). Part II 305 and 229 g/m* at 16 percent contract regain
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in unitsof kg. Products of three different mills A, B and C were tested and the test results are reproduced below:
Mill B" C WarpruayBreakingStrength (kg) 87,96,99,94,91 93,99,88, 2 , l",$, ,
Total

No. of Obmvntions
ZI

TOM

MN?2 93.4 94-6 90.0

loo,91

3
16

467 757 270
1494

The various calculations needed for forming the required analysis of variance are as follows : . . . (84)s= 139 940 4 Uncorrected total sum of squares : (87)*+(96)*+.

b)

Uncorrected

sum of squares between mills : :

(1 w* -=139 502.25 16 : (a)-(c) =437*75 d) Total sum of squares 4 Sum of squares between mills : (b) -(c) =46.68 f > Sum of squares within mills : (d) - (e)= 391.07 The degrees of freedom corresponding to the various entries of the analysis of variance table are obtained as follows : Degrees of freedom for total sum of squares=Total number of observations - 1 = 16 - 1 =15 Degrees of freedom for between mills=Total number of mills- 1 =3-_1=2 Degrees of freedom for \vithin mills=Total degrees of freedomdegrees of freedom for between mills=15-2213

c>Correction

factor

ANALYSIS OF VARIANCETABLE

sumof
Squares
Between mills Within mills Total

Mean Square 23.34 30.08 s

F Ratio

2 13
15

46.68
391.07 437.75

zO.78

To test the hypothesis that there is no appreciable variation between the average breaking strength of the products of the different mills the variance ratio is formed as F= `g = 0.78. Since this is less than 3.81 which is

the tabulated value of F for 2 and 13 degrees of freedom at 5 percent level,

Is : 7699 - 1975 Therefore, it may there is insufficient evidence to reject the hypothesis. be concluded that there is no appreciable variation in the breaking strength of the products of the three mills. following data are from 5 replicate runs on the 4.1.5 Example 2 -The time of passage of aluminium powder through 6 different test units. It is endeavoured to test whether there is any significant difference between the 6 test units when compared with the variation within units :
RUtZ
: : 5

Unit-t

1

2

3

4

5

6

52.9 52.3 52.2 52.5 52.7

54.0 53.8 53.8 53.6 53.6

53.2 52.6 53.4 53.4 53.0

50.8 50.5 50.8 50.7 50.5

54.6 5414 ::: 54.4

54.0 53.3 53.5 53.7 53.7

The data can be coded by subtracting 50 from each value and multiplying the remainder by 10 to remove the decimal. Coding by adding or subtracting a constant has no effect eon the calculation of the corrected sum of squares or the variance. Coding by multiplying by a constant will change the resulting variance by the square of this constant. However, since the mean squares are tested by a ratio of two calculations, the coding factor will cancel. If ultimately it is intended to use the calculated variances, they will have to be decoded by dividing by 100. The coded data and the analysis of variance are as follows : RUG
: 3 5 4 Total Mean 27 $3 126 25.2 Unit-+ 1 :z 2 .fi 38 36 188 37.6 3 z; 34 :: 156 31.2 5 ;: 33 6.6 44 ii 224 44.8 4 s 5 :Fi 6 40 z: :r; 182 36.4 909

-

The various calculations are obtained as below : 4 Uncorrected totalsum of squares : (!$)st1$23)2+.
'

. . . + (35)2+ (37)s

b) 4 4 4

Uncorrected Correction

sum of squares between units : & [ ( 126)2 + (1 88)2 + . . . . (182)2]=31 989 factor : q&7 542.7

Hence the corrected sum of squares are obtained as : Total sum of squares: (a)-(c)=32 119-27 542.7=4 576.3 Sum ofsquares between units : (b)-(c)=31 989-27542.7=4446.3 8
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f) Sum of squares within units : (d) -(e) =4 576.3-4 446.3 = 130.0 The degrees of freedom for the various entries of the analysis of variance table are obtained as follows : Total sum of squares : 30- 1=29 Between units : 6-l= 5 Within units : 29-5=24
ANALYSIS OF VARIANCETABLE Source of Variation D.F. Sum of Squares 4 446.3 130.0 4 576.3 Mean Square 889.26 5.42 F Ratio

_
Between units Within units Total **Highly significant. 5 24 29 88926 = 164.1** 5.42

Tabulated F value for 5 and 24 degrees of freedom at 0.01 level is only 3.90. The Fvalue as obtained from the data is highly significant. Hence there is a strong evidence of a factor between units which causes a variation in the results greater than that which can be accounted for by variation within units. It may now be desirable to compare all possible pairs of means to find out which of the means or how many of them differ significantly from the others to cause the overall variation among means to be significant. For this purpose use is made of t-test for difference between means, taken two at a n(n- 1) time. With n means, there are 2 comparisons (number of combinations of n things taken two at a time). 6x5 o = 15 comparisons. The residual variance gives a measure of the precision of a single This measure is identical with the pooled estimate measurement. of the standard deviation. In the present example it is obtained as $$@=2*327. Standard deviation of the mean of five measurements is 1/5 times the standard deviation of single measurement and is equal to In the present example there will be

The critical difference, useful for Cnding out whether any two means are significantly different or not, is obtained as t,, x fix 1.040 6 where tap is the tabulated value of the t distribution for 24 degree of freedom at the 5 percent level, which is obtained as 2.064. The critical difference is thus obtained 9

as ~3.04. It may, however, be noted that the use of critical difference is valid only when the F ratio in the analysis of variance table is found to be sign&apt. After obtaining the critical difference the various means are arranged either in the ascending nor the descending order of magnitude as follows :
Unit No.

4 6.6

1 25.2

3 31-2

6 36.4

2 37.6

5 44-8

Means

Any two means not under scored by the same line are significantly different. Thus in the present example only the means between unit 6 and unit 2 are not different. Hence it may be concluded that a11 other units give results which are significantly different from one another. 4.2 Single Classification with Subgrouping (Nesting)

4.2.1 Often the principal classification in the data can be divided into sub-classifications which do not cut across the main classes. Each sample may be composed of sub-samples and these, in turn, may consist of sub-subsamples. The repeated sampling and sub-sampling give rise to nested sampling or hierarchical classification. For example if we draw 10 bales of wool at random from a shipment and take three cores of wool from each bale, the cores are said to be nested within bales. Again several analysts might draw two, three or four specimens (not necessarily the same number for each analyst) from a batch of material and run several replicate analyses on each specimen.. The replicate analyses give an estimate of the error variance. The specimens run by the same analyst give a measure of variation within the batch plus the error. The differences between the results of the several analysts include not only the variation between analysts but also the variation between specimens and the error variation. Same situation holds good if several test pieces are cut from each of several rolls of fabric made wholly from several different machines from each of several plants. As long as there is no relation between the corresponding members of the different groupings, the analysis of variance by subgroups (or nesting, as it is sometimes called), applies. Where there is relationship between the members of the subgroups, so that the corresponding division of each subgroup can be considered as a separate class, we have an analysis of variance for more than one main classification. The general arrangement of data for single classification with several hierarchies of subgrouping is illustrated by the following example. 4.2.2 Example 3 - A series of trials is made by three operators to locate a source of variation in a chemical analysis. The procedure consists in taking a specimen, treating it in a combustion-tube furnace~and performing the chemical analysis. In the test, three operators each took two specimens and made three combustion trials on each specimen and titrated each trial in duplicate. The single letter A represents the operator factor, the double 10
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letter B(A) indicates the first or second specimen taken by the operator, and the triple letter C(AB) indicates the .combustion trial on each specimen by each operator. The results are shown in duplicate. Uncorrected sums of squares are obtained as follows : (156)s+(151)2+. . . . +(184)2+(177)2 4 Total sum of -squares: =~817 085 b) Combustion within operators and specimens C(AB) : (310)s+(305)s+ . . . . +(3W2+(3W2 = 816 778 2 Specimen within operator B(A) : (929)2+(905)2+. . .+(I O98)2+(1 065)2 = 815 2og 6 Between owerator : (1 834)2'+(1 318)2+(2 163)2 = 8!4 g37 12 Correction factor : (5 36315)2 = 784 700
ANALYSISOF VARIANCE TABLE

Source of Variation
Operators A Specimen B(A) within operators

D.F.
2 3 12 18 35

Sum of Squares
814937-784700=30237 815209-814937= 816 778-815 209= 272 1569 307

Mean Sguare
15 168 91 131 17 131

F Ratio
15 168 -= 131 116+*

Combustion within operators & specimens C(AB) Replicates, error Total **Highly significant.

17=
(By subtraction) 817085-784700=32385

7*7**

From the analysis of variance table it may be concluded that the largest There is no evidence of source of variation in results is between operators. variation between specimens. There is a definite source of variation in combustion step in analysis. 5. ANALYSIS OF VARIANCE - TWO-WAY

CLASSIFICATION

OF

DATA
pressures or if material from several sources of supply is tested under a variety of conditions or if a group of operators makes a series of runs on a

5.1 If a series of experiments is run at different temperatures and different

IS : 7690 - 1975 number of pilot plants, two-way classification of data is obtained. In such a case, one of the characteristics can be represented along the rows and the other along the columns. Further, each of the cells formed by the two-way classification can have either one or more than one observation. 5.2 Two-Way Classification with One Observation in Each Cell 5.2.1 The analysis of variance for this arrangement of data is similar to that for the single factor arrangement. The column-factor effect is calculated from the squares of column totals and the row factor effect, from the squares of row-totals. The following example illustrates the method of analysis. 5.2.2 Example 4 - Six samples of dextrose monohydrate were analysed in each of the seven laboratories for copper content (measured as ppm). The data obtained by the investigation is given below :
Total

-z@c'

2

3

4

5

6

5 6 7 Total

8:; 0.4 0.5 4.0

8:; ;:; 4.8

0.2 ;:; 0.1 2.0

8:; 0.4 1.0 3.3

;:; 0.6 1.2 3.8

0.3 I:; 1.2 4.9

;:`3 ;:A 4.6 22.8

The uncorrected

sum of squares are calculated as follows : a) Total sum of squares : (@3)2+d@2)2+ . . . .+(1.2)2=17.56 b) Between laboratories : (22)2 +(3*5)2 . . + (46)2-13.90 -+.. -sum of squares 6 : (4;)s +(4$ c) Between samples T+....+v=13.20 sum of squares Correction factor : (22'8)p_ - 12.33 42

Analysis of variance

table is formed as follows :
ANALYSIS OFVARIANCE TABLE F Ratio 0.25 -= OS.09 0.16 -= 0.09

Sourceof Variation Between rows (laboratories) Between columns (samples) Error Total

D.F. 6 5 30 41

Sum of Squares 13%-12.38= I.52

Mean Square 0.25 0.16 0.09

2.7$ 1.78

13.20-12.38=0.82 (by subtraction) 2.84 17*56-12.38=5.18

13

IS : 7600 - 1975 From the analysis of variance table, it can be concluded that there is no significant difference between either the laboratories means or the sample means. 5.3 Two-Way Classifkation with Multiple Observations in Each Cell - In this type of classification more than one observation is obtained in each cell formed by the rows in the columns, that is, repeated measurements are made on different randomly selected individuals. 5.31 Replication facilitates fuller analysis of the data and the precision of the experiment increases with replication. 5.3.2 In the two-way classification with more than one observation per cell, besides the variation due to row and column effects there would also be an interaction effect which is the result of different row column combinations. As all the different combinations from the groups of the two factors of classification play their part in the experiment, they also contribute to the total variability. This interaction becomes one of the sources of variabiSupposing lity which must be taken into account in the analysis of variance. the criteria of classification are different varieties of material subjected to different treatments. In such a situation the same variety can be differently affected by different treatments and the same treatment can show different effects with different `varieties. This combination effect is the interaction. The following example illustrates the method of analysis. 5.3.3 Example 5For studying the accuracy of water meters for continuous rate of flow, 5 water meters were tested by each of the two operators A and B. Each operator made five repeat observations on each meter. The observations in terms of the percentage accuracy of the meters are given below : operator
,--__A1 A 2 -1 -1 -1 Cell total B - 3.5 --4 z: Cell total Total ;5 y-5 2 Water Meter JVumbn 3 + -----T I 33.5 -3 1; - 15.5 1; 1 i.5 -3 - 16.5 -32.0 -44.0 -78.5 - 34.5 Total

1
-3

23.5 z `3:;
7 1.5 z.5 - 12.0 1; ;`; 2' 1; -1

-1
- 13.5 1; 7 -

2'

z
2 IO.0 --0 8 2" 2.0 12.0

1.5 7.0

-12.5 -26.0

-10.0
-22,o

-10.5

14
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From the above table the various sums of squares are obtained as follows : 4 Uncorrected total sum of squares= (0*5)s+ (--I)*+ * y-;r Correction factor (CF) = (-`8*5Y=1*3.24 Hence the corrected total su&?f squares=270.75-123.24=147.51 (-10*5)`+ lo . . . . (-32.0)s 1. _ CF

bZ Between

meters sum of squares =

= 243.82 - 123.24 = 120.58

4 4

Between operators sum squares = Interaction (-3.5)s

(-34.5)s +(-44.0)s 25 = 125*05-123.24%81 meters sum of

_ CF

=T+....P$-

sum of squares - s CF-between

squares-between operators sum of squares = 252~05-123~24-120~58-1~81 = 6.42 The analysis of variance table is then formed as follows :
Source of Variation Sum of Squares 120.58 Mean Square 30.14 F Ratio

Between meters (M)

30.14 16o= 1.81 160= 1.60 oT =

18.84** 1.13 3,40*

Between operator (0) Interaction Error Total **Highly significant. *Significant. (M
x

1 4 40 49

1.81 642 18.70 147.51

1.81 1.60 0.47

0)

When the interaction is tested against the error it is found to be highly significant thereby indicating the presence of interaction. It may hence be interpreted that different water meters behave differently with the change of operator. In view of this finding, there is an urgent need for the procedure for testing of the water meters to be standardized. Since the interaction is significant, under the random effects model chosen for the experiment, meters and operators are to be tested against the interaction. This testing reveals that between operators variation is not significant whereas between 15

IS : 7600 - 1975 meters variation is highly significant. The latter finding is perhaps to be expected, since the meters had originated from different manufacturers. 5.4 Two-Way Replication) Classification with Sub-grouping (Nesting and

5.4.1 Sub-grouping m nesting of data within a main classification can occur with data that are collected under two main classifications. For example, if several factories are producing the same product on a batch basis and each makes several hatches at two or more conditions then a situation illustrated by nesting is obtained. 5.4.2 The batches from each factory are not related to batches from other factories and for the purpose of analysis of variance they are simply subgroups of the factory classification. Any variation between factories would include the variation between batches within factories. The condition factor is an independent classification and its effect on the variation of results is reflected in the difference between factories, that is to say, only in so far as there is interaction between the conditions and factory factors or the condition and batch factors. 5.4.3 Sub-grouping can exist under either factor or both the factors in the same set of data: Secondary sub-grouping can exist within the first sub-groups. In fact, there can be any hierarchy of sub-groupings under both main classes of factors. An illustrative example is given wherein subgrouping occurs under both the factors. 5.4.4 Example 6-For studying the effect of storage time and packing on the moisture content of corn flakes, two types of packings namely, polythene bags and polythene bags in cartons were chosen and the periods selected were 2,4 and 6 months. Six different samples of corn flakes belonging to the same batch of manufacture were analysed in duplicate after storing them for 2, 4 and 6 months in the two types of packings. The resultant data is given below :
Storage Period 2 months Cell Total 4 months Cell Total 6 months Cell Total Sample 1 6.17 6.11 Packing Type I Sample 2 4.40 4.72 Sample 3 4.22 3.80

I
Sample 1 4.66 5.00 9.66 4.52 4.43 --iG 6.89 6.74 13.63

Packing Type 2 Sample 2 3.98 4.00 7.98 4.56 3.17 7.73 7.12 7.06 14.18 5.88 4.83 10.71 6.55 6.65 13.20 Sample 3 6.22 540

-

-

-

12.28 5.50 5.48 10.98 6.10 6.13 12.23

-

9.12 4.46 4.85 9.31 5.80 5.73 11.53

8.02 4.61 4.25 8.86 6.79 7.73 14.52

-

16
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From the above data a sub-table of the following type is formed to assist in the computations.
SUB-TABLE

Storage Period 2 months
4 months 6 months Total

----7

Packing
Type 2 29.26 27.39 41.01 97066

Total

Type 1 29.42 29.15 38.28 96.85

5868 56.54 79.29 194.51

The various sum of squares are then obtained as follows : a) Uncorrected total sum of squares= (6.1 7)2+. . (6~65)~= 1 095.181 9 Correction factor (CF) = (194*51)2 = 1 050.948 3 1095.1819(96.85)2+ = l8

1050.948 3=44.233 6 CF (97.66)2 -b) Between packing sum of-squares 18 = 1050.9666-l 050.9483=0.0183 CF (58.68)2+ (56.54)"+ (79.29)s c) Between periods sum of squares =12 12 --E--= 1077.2515-i 050.9483z26.3032 d) Interaction (packing x period) sum of squares __(29.42)2 I . . . . -(41.01)s -CFbetween packing sum of squares 6 6 -between period sum of squares = 1078.13281050.9483-0.0183-.26.3032 =0.863 0 Corrected total sum of squa?z= me) Between samples (within packing and period) sum of squares (29.42)2+ (41.01)s (13.20)2 = (12.28)2+ . . ..--. . . .6 6 2 2 [ 1 092.521 4- 1 078.132 8=14988 6

=

11

1

The analysis of variance table is then formed as follows : From the analysis of variance table, between sample sum of squares (within packing and period) is tested agamst error and the F ratio so obtained Because of this fact both the main effects due to packis highly significant. ing and period as also the interaction are tested against between sample (with.in packing and period). The testing reveals that the mean value of moisture content for between periods is highly significant, corroborating the general presumption that corn flakes gather moisture depending on the period of storage. The analysis also reveals that there is no significant 17
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: 7600 - 1975

difference packings.

in the moisture content

of corn flakes stored in two different

ANALYSISOF VARIANCE TABLE Source of Variation

Sum of squares
1 2 0.018 3 26.303 2

Mean Square
0.018 3

F Ratio

Between packing Between periods Packing period Between and period) samples (within packing Error

-= 0.018 3 1.199 0

0.002

13.151 6 0.431 5
1.199 0 0.147 8

-13.157 6 = 10*g7** 1.199 0 zo=0.36
__!z%*.* 0.147 8 1++

2
12 18

0.863 0
14.388 6 2.660 5 (by subtraction) 44.233 6

Total **Highly significant.

35

5.5 Three-Way Classification - If there are three different sources of variation (factors of classification) which are likely to act independently of one another and contribute to the total variability of the data, then the total sum of squares has to be split up into three components due to these sources. The analysis of variance follows the same formulation as that of a two factor arrangement. When more than two factors are involved, all possible combinations of interactions may exist and a complete analysis of variance provides mean squares attributable to all the main factors and-all individual interactions. 5.6 Factorial Experiments - When the effect of several variables on a product or process is of interest, it is possible to devise experiments where For each variable, a number of all of them may be studied simultaneously. categories or levels may be chosen for study. If an equal number ~of observations is made for all possible combinations of levels (one level from each variable), the experiment is called factorial. In a completely balanced experiment, each level of each factor is tested at all the levels of all the other factors so that the total number of observations required is the product of all the levels and all the factors. Thus in a factorial experiment to study the wear resistance of vulcanized rubber wherein five qualities of filler (factor A), three methods of pretreatment of the rubber (factor B) and four qualities of raw rubber (factor C) are involved, the total number of observations for a When more than two complete experiment turns out to be 5 x 3 x 4=60. factors are involved, all the possible combinations of interactions may exist and a complete analysis of variance provides mean squares attributable to all individual interactions. 18

IS : 7600 - 1975 6. CLASSIFICATION OF HIGHER ORDER

6.1 There is no limit to the number of main factors like different treatments, materials, laboratories, temperatures, pressures, catalysts or concentrations, that may be examined in the same experiment. A suitable design of experiment is a pre-requisite in such studies.

APPENDIX (Clause3.1)
FIXED, RANDOM

A
MODELS

AND MIXED

If XtJk is the dimension of kth component produced on the ith machine onjth day in a plant, then it may be written that

where P ai bj Clj
eilk

= overall mean dimension, = effect due to ith machine; = effect due to jth day, = effect due to interaction of ith machine and jth day, and random effects which are independently normally distri= buted with mean 0 and variance 0,s.

In the random effect model where the machines under study are considered as a random sample from a large number of machine as also the days are the randomly chosen ones, ar, bi, and cij are all assumed to be independently normally distributed with 0 means and respective variances ups, 02 and u*a2. In the fixed effects model where the conclusions are to be drawn only on the few machines that are under study and on the specific days chosen in the experiment, it is assumed that : ai=? 1 bj=: cif=+ 1 cij=O

In a mixed effect model where the machines are considered as a random sample from a large number of machines, but the days are those specifically chosen, ai and crj are assumed to be independently normally distributed with 0 mean and variances a~2 and u.& and .Z 6~=J2 ctj=O.
j J
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